-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add min_pool_size, Add default value of should_shuffle #70
Add min_pool_size, Add default value of should_shuffle #70
Conversation
DataProvider* DataProvider::create(const DataConfig& config, | ||
const ModelConfig& modelConfig, | ||
bool useGpu) { | ||
return registrar_.createByType(config.type(), config, modelConfig, useGpu); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add ModelConfig in DataProvider::create
to get input layer order
d20681d
to
78170c3
Compare
return dp;\ | ||
});\ | ||
}) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add more comment
Also please update the data provider documentation |
aab1c00
to
0210938
Compare
@emailweixu Update codes. Add chinese docs. Engligh document will be added asap. |
0210938
to
079cc3f
Compare
Need to fix test |
fee49a9
to
5b62bb1
Compare
@emailweixu The unittest error before, is because we didn't disable shuffle when unittest and this patch set the Also add english documentation. |
5b62bb1
to
e32e474
Compare
* cache is a data cache strategy, see `cache`_. | ||
* Init_hook function is invoked once the data provider is initialized, | ||
see `init_hook`_. | ||
.. autofunction:: paddle.trainer.PyDataProvider2.provider |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here, we use paddle.trainer.PyDataProvider2.provider's comments as documentation.
e32e474
to
18101a9
Compare
* min_pool_size would be infinite by default. * add unittest for min_pool_size * Fix bug in can_over_batch_size * add unittest for can_over_batch_size * Add DEFINE_PROVIDER_EX * Add default value of should_shuffle * When training, the default value of should_shuffle is True. * When testing, the default value of should_shuffle is False. * User a set a provider should_shuffle or not by pass it to `@provider` * should_shuffle can handle a list of value, not just boolean * Add input order mapping by using name * Add unittest * Add check to check input format. * Default is close for speed reason. * User could stop train when check error, or continue train without this train sample. * use deque instead of vector in generators pool, make erase generator faster. * Add chinese/english documentation * Make should shuffle = false in unittest * Add python files to depends.
18101a9
to
981d733
Compare
* refactor lower function * refine LoweredFunc code gen * add const support
* add paddleIArray * use final inherit, rm data_
* update paddlenlp usage * update paddlelsim * update readme Co-authored-by: ceci3 <592712189@qq.com>
[DOC] Add C/GO API and R demo
Made several changes: - an -> a - Realse->Release - Traning ->Training - Unify application with noun.
This reverts commit 01a4c47.
add group_pattern_util.ShardableAxesProvider
@provider
this train sample.
generator faster.